Snac support #3075

maximizemaxwell · 2025-09-06T13:06:12Z

Part of issue

#3057

What does this PR do?

Implement snac features and integration

Summary

Implement SNAC (Multi-Scale Neural Audio Codec) integration for
Text-to-Speech applications
Add comprehensive TTS utilities and configuration presets for speech
synthesis
Provide example demonstrating Qwen + SNAC TTS pipeline

Changes Made

New module: snac_tts_integration.rs with TTS-optimized SNAC codec
wrapper
Enhanced SNAC model: Added TTS-specific methods (encode_for_tts,
decode_from_tts_tokens, batch processing)
Config presets: Added default_tts(), high_quality_tts(), fast_tts()
configurations
Utility functions: Memory estimation, token validation, voice embedding
creation
Example implementation: qwen_snac_tts_example.rs showing complete TTS
pipeline

Key Features

Multiple quality presets: 24kHz speech, 32kHz general, fast 16kHz
options
TTS pipeline abstraction: SnacTtsPipeline for easy integration with
language models
Batch processing support: Efficient handling of multiple audio streams
Memory optimization: Token padding, truncation, and memory estimation
utilities
Voice cloning support: Reference audio embedding extraction

lucasjinreal · 2025-09-07T05:33:07Z

Hi, does it about to work? Seems we can support SparkTTS and VovyTTs once this workable.

lucasjinreal · 2025-09-18T05:33:16Z

@maximizemaxwell Hi, would like add some checkpoint conversion docs, I'd like verify it's result is normal or not, once it done, we can consider merging SNAC support and enable several SOTA TTS models which used snac

maximizemaxwell added 2 commits September 6, 2025 22:01

feat: implement snac

5490330

add examples

c8d5a59

maximizemaxwell marked this pull request as draft September 6, 2025 13:06

fix: fix errors

c2893f4

maximizemaxwell marked this pull request as ready for review September 16, 2025 11:34

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Snac support #3075

Snac support #3075

Uh oh!

maximizemaxwell commented Sep 6, 2025 •

edited

Loading

Uh oh!

lucasjinreal commented Sep 7, 2025

Uh oh!

lucasjinreal commented Sep 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Snac support #3075

Are you sure you want to change the base?

Snac support #3075

Uh oh!

Conversation

maximizemaxwell commented Sep 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Part of issue

What does this PR do?

Summary

Uh oh!

lucasjinreal commented Sep 7, 2025

Uh oh!

lucasjinreal commented Sep 18, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

maximizemaxwell commented Sep 6, 2025 •

edited

Loading